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Abstract 

During an infection, HIV experiences strong selection by immune system T cells. Re- 
cent experimental work has shown that MHC escape mutations form an important path- 
way for HIV to avoid such selection. In this paper, we study a model of MHC escape 
mutation. The model is a predator-prey model with two prey, composed of two HIV vari- 
ants, and one predator, the immune system CD8 cells. We assume that one HIV variant 
is visible to CD8 cells and one is not. The model takes the form of a system of stochastic 
differential equations. Motivated by well-known results concerning the short life-cycle of 
HIV intrahost, we assume that HIV population dynamics occur on a faster time scale then 
CD8 population dynamics. This separation of time scales allows us to analyze our model 
using an asymptotic approach. 

Using this model we study the impact of an MHC escape mutation on the population 
dynamics and genetic evolution of the intrahost HIV population. From the perspective 
of population dynamics, we show that the competition between the visible and invisible 
HIV variants can reach steady states in which either a single variant exists or in which 
coexistence occurs depending on the parameter regime. We show that in some parameter 
regimes the end state of the system is stochastic. From a genetics perspective, we study 
the impact of the population dynamics on the lineages of HIV samples taken after an 
escape mutation occurs. We show that the lineages go through severe bottlenecks and that 
the lineage distribution can be characterized by a Kingman coalescent. 

1 Introduction 

During HIV infection, HIV and the immune system T cell populations form a complex and 
dynamic coupled system. Many authors have modeled and analyzed this interaction, e.g. 



1 18: Till . In such work, authors usually consider the interaction of HIV, CD4 T cells, and 
CD8 T cells through deterministic predator-prey ODE models. In this paper, our biological 
motivation is to consider the effect of MHC escape mutations (described biologically below) 
on the genetic diversity and population dynamics of the infecting HIV population. Mathe- 
matically, in order to examine these biological issues, we extend the typical ODE models by 
including stochastic effects in the population dynamics and considering lineages of infected 
HIV cells as one looks backward in time. More specifically, we change the typical ODE 
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models to stochastic differential equations (SDE) and consider a coalescent process of HIV 
evolution on top of the population dynamics. 

We consider a model of HIV-CD8 interaction that focuses on so-called MHC escape mu- 
tations. Roughly speaking, when HIV enters a CD4 cell certain mechanisms within the cell 
cut up HIV proteins into small pieces (usually 8-11 amino acids long) and present these 
pieces on the surface of the cell. This presentation is accomplished by the binding of the viral 
pieces to so-called MHC I molecules to form a peptide-MHC complex (pMHC) |4J]. For our 
purposes and to simplify the explanation, the pMHC complex can be thought of as represent- 
ing a certain short nucleotide sequence in the HIV genome. CD8 cells are equipped with T 
cell receptors (TCRs) that can bind to a pMHC complex and then destroy the presenting cell. 
Critically, each TCR binds to a limited pattern of nucleotide sequences Jj]. In this sense, 
since each CD8 cell has only one type of TCR, we can think of each CD8 as targeting some 
short segment of the viral genome. 

Recently, there has been much experimental and statistical work on mutations in HIV that 



avoid MHC I presentation (e.g. II It I5t l8c 1 17c 12 011) an d many authors have suggested that such 
mutations play a key role in HIV dynamics 0] ■ The MHC I molecule cannot present 

every type of nucleotide sequence [4], and it is possible for the virus to mutate and evade 
MHC presentation. In such a case, a mutation (or series of mutations) will render the virus 
invisible to the CD8 cell that was previously able to attack virus infected cells. 

We consider a simple model of such an MHC escape mutation. We assume that initially 
all HIV infected cells are subject to attack by a collection of CD8 cells with a shared TCR. 
We refer to CD4 cells infected by HIV variants that are visible to these CD8 cells as wild 
type cells. Then, we assume that a single infected cell changes in its HIV genetic state and 
becomes invisible to the CD8 cell attack, we refer to this new type of infected cell as the 
mutant type. Our motivation centers on understanding the population dynamics and genetic 
evolution caused by escape mutations that occur during the chronic stage of HIV when CD8 
and HIV population sizes are relatively stable 0|. In terms of population dynamics, we 
are interested in whether the mutant type survives, the wild type survives, or both. From 
a genetics perspective, the escape mutation reflects a change in the HIV genome at a given 
location, but we are interested in the effect of the resultant population dynamics on other 
parts of the genome. As an analogy, strong selective sweeps, for example, may be caused 
by mutations at a given point in the genome, but other areas of the genome are affected by 
the sweep HOP . In some sense, our model represents a complex selective sweep involving 
three players and an uncertain final outcome. We would like to understand the impact of 
this complex selective sweep on intrahost HIV genetic diversity. In this paper we will ignore 
recombination. 

We specify birth and death rates for the CD8, wild type, and mutant type cells and con- 
sider the associated coupled birth-death processes. However, rather than consider the birth- 
death processes directly, we analyze an associated SDE system. A critical feature of our 
analysis is a separation of time scales. HIV has been shown to evolve on a time scale of hours 
to days, while the CD8 cells evolve on a time scale of days to weeks lUlll . Technically, this 
means that the CD8 cells' birth-death process has much lower rates than the wild and mutant 
type birth-death processes. We use this separation of time scale to apply an asymptotic anal- 
ysis to the associated SDE system. We analyze the SDE system in a large population limit 
that takes the number of HIV infected cells to infinity. Since the HIV virion population size 
is on the order of 10 9 11 ill , this limit is appropriate. 

We show that in a certain range of time scale separation, stochastic effects become very 
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important. Our results demonstrate that in certain parameter regimes one of three events 
occurs with probability one: the mutant type may be lost, the wild type may be lost, or 
coexistence may occur. Further, we show that in other parameter regimes, the probability of 
each of these three events is strictly between zero and one. These results contrast with results 
for the deterministic analogue to our SDE system in which coexistence always occurs. The 
degree of the time scale separation connects to the rate of CD8 cell response and so our results 
show that the end result of MHC escape mutations depends heavily on immune system speed. 

We consider questions of genetics diversity by considering the lineages of a sample of 
infected cells taken after the escape mutation occurs. We show that the relationship between 
the number of samples taken some time after the mutation and the number of corresponding 
lineages that^ exist immediately before the mutation can be described through a Kingman 
coalescent [6]. An open question that has received considerable attention is the effective 
size of HIV during infection. Effective size is a way of baselining the genetic evolution of 
a population by comparing it to the genetic evolution of classical Wright-Fisher populations 
Q6Q . Our results demonstrate that the effective size of HIV is heavily influenced by bottlenecks 
that occur during MHC escape mutation. We show that the magnitude of these bottlenecks 
depends on the rate of CD8 cell response and that in this sense, the effective size of HIV is 
affected by the speed of the immune system. 

2 The Model 

We let v(f), v*(t),p(t) be the number of wild type infected cells, mutant type infected cells, 
and CD8 cells targeting the wild type infected cells respectively (here p stands for predator). 
We assume the dynamics of these populations are given by a birth-death process with the 
following rates. 
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The parameters k, k* , Ak, Ak* , h represent baseline birth and death rates for each of the cell 
types when interaction between the cell types can be ignored. Note that we set the birth and 
death rates of p equal because CD8 cells require antigenic stimulation to expand in number, 
on the other hand we can assume that the HIV birth rate exceeds the HIV death rate for both 
mutant and wild type cells. The parameter a measures the rate of CD8 cell killing of wild 
type cells, c is a logistic growth factor representing competition between infected cells for 
uninfected cells. Finally b represents the rate of CD8 expansion in the presence of wild type 
antigen, while d represents a logistic growth factor corresponding to competition between 
CD8 cells for antigenic stimulation. 

2.1 The Approximating SDE 

InH, Kurtz described the connections between birth-death processes and an approximating 
SDE. For HIV, v, v* and p are all of enormous order. If we rescale the system to makes these 
variables O ( 1 ) we can arrive at the following SDE system which approximates the birth-death 
processes of v,v*,p. In ( 12.11 ), V is the order of the infected cell population size and v, v* and 
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p represent the now rescaled population variables (see the appendix for a precise description 
of the rescaling). 

dv = v(l - (v + v*) - P )dt +\j V{k+ (V ^ (r), (2.1) 

dv* = v*(f- ( v + v *))dt + ^^ + ^' + V * )) dB 2 (t), 
dp = Bp(v— ap)dt, 

where £ = f,OC = We set wfa) = (v(f), v*(*), /?(*)). 

A theorem of Kurtz, see 11411 . says that such approximations becomes exact as the scaling 
factor, our V, goes to infinity. However, our system is not exactly of Kurtz's form as our 
parameters will be scaled with V, i.e. e will be taken <5(^|^). We have not pursued this 
technical issue, rather from this point on we take (12.11 ) as our description of the evolution 
of u and no longer consider the birth death processes. Although this is an approximation, 
we believe that our results will hold for the birth-death processes as well. Further, we point 
out that the birth-death process is an approximation of underlying dynamics, so to a certain 
extent considering ( 12. II ) is no worse than considering the birth-death process. Finally, notice 
that there is no stochastic term in the p equation. There should be one, but we have dropped 
it. This will have no effect on our results and simplifies the explanation (see section [5] for a 
more precise justification). 

We assume that the escape mutation arises at time t = 0, and that previously v* = and the 
system ( 12.11 ), restricted to v,p, is in equilibrium. This assumption of equilibrium corresponds 
to our interest in the chronic stage of HIV infection. More precisely, we take 

v(0) = T ^, AO) = 1, ,(0) = ^ (2.2) 

In ( 12.lt the absolute fitness of v, in the absence of CD8 cell effects, is 1 while the fitness 
of v* is /. Typically, in order to avoid immune system attack, escape mutants are less fit than 
the original wild type and so we take / < 1. We contrast absolute fitness with CD8 influenced 
fitness. If CD8 cell attack is considered, the fitness of v and v* are 1 — p and / respectively. 
If 1 — p(0) > f then the mutant is initially less fit than the wild type and the dynamics are 
not interesting, indeed the mutant will simply quickly die out. We restrict our attention to the 
interesting case of 1 — p(Q) < f. Using ( 12.21 ) this translates to /— Of (1 — /) > and we will 
assume this condition throughout the rest of this paper. 

We will consider ( 12.11 ) in the limit V— > °° with e = O(^iy) over a time interval [0, 0{^)]. 
The e scaling will be shown to be the correct scaling to see 0(1) stochastic effects, and the 
time interval scaling is the length of time the system needs to be guaranteed to return to an 
equilibrium after the escape mutation arises. 

Associated with (12.11 ) is the deterministic analogue in which the stochastic terms are sim- 
ply dropped. 

v = v(l-p-v-v*) (2.3) 
r = v*(f-v-v*) 
P = ep(v — ap) 



4 



with, 

v-CO)^, P(0)^ (2.4) 

We set 5(f) = (v(f),v*(f),/?(f)). Throughout this paper, we useabarto distinguish a variable 
associated with the deterministic system ( 12. 3t from the corresponding variable associated 
with the stochastic system (12. 11 1. 

2.2 Decomposition of Oscillations 

The deterministic system ( 12.3b produces oscillatory dynamics as the system moves from the 
original equilibrium of u = ( , , ) to the new equilibrium of h = (a (1 — /),/ — Ot ( 1 — 
/), 1 — /). Figure[T]shows an explicit solution of (12.31 l for e = .01, a = 1,/ = .8. 




Figure 1: Solution of (lOt fore = .01, a = 1,/ = .8 up to time O(^) 



The oscillatory dynamics result from the time scale separation. Figure[2]provides a zoom 
in of a first oscillation in the dynamics of u. Immediately after the escape mutation arises, at 
t = 0, there is an initial stage of the dynamics in which v, p are held relatively fixed while v* 
rises. This initial stage, which in Figure|2]ends at t = T s , occurs only immediately after t = 
and is not repeated. After the initial stage ends at time T s , we separate the oscillation into 
four stages which form a full cycle: stage I delimited by [T s , Tj], stage II delimited by [?/, Tn], 
stage III delimited by [Tjj, Tju], and stage IV delimited by [7///,7/y]- I n Stage I, v collapses 
to o(l) levels while v* rises to 0(1) levels. This occurs because the CD8 influenced fitness 
of v, given by 1 — p, is less than / during Stage I. In Stage II v stays at o(l) levels, but p 
drops until v becomes more fit than v* . When p = 1 — /, v and v* are equally fit, indeed at 
this point v reaches its minimum. After that time v rises until v(Tjj) = v(Tj). In Stage III, v* 
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is less fit than v and v* collapses to o(l) levels. In Stage IV, p rises until v* becomes more 
fit than v which causes v* to rise until v*(7/y) = v*{fni). At the end of Stage IV the system 
has returned to the situation of time T s and the cycle repeats. We refer to the Stages I-IV as 
a cycle, and so the dynamics of u and u are formed by a sequence of cycles. As Figure Q] 
shows, with each cycle the strength of the oscillations is damped. The stochastic system ( 12. Il l 
has identical stages with 7}, 7}/, 7}//, 7}/ defined analogously to 7}, 7#, Tjn, Tu, T s is used for 
both ( 12. U and ( 12.31 ). The definitions of r s , 7/, 7//, f///, 7)/ will be made precise in section [SJ 
for now we simply provide the reader with an intuition for the dynamics. 




Figure 2: Solution of d2.31 > for £ = .005, a = 1,/= .8 showing only the first cycle in the 
dynamics of u 



We will show that d2.ll ) behaves essentially as (12.3b . The exception will be during times 
in a small subinterval in each of stages II and IV during which v and v* are at a minimum. If 
£ is too small, then at these subintervals v or v* will be driven to zero by the stochastic terms 
in ( 12. jj . If £ is too large, then at these subintervals, ( 12.11 ) will behave as (12.31 ) and stochastic 
effects can be ignored. However, if £ is at the appropriate scaling, namely 0(^|y), then we 
will show that v, in stage II, and v*, in stage IV, behave like Feller diffusions. 

The damping of the oscillations will be crucial to our analysis. Stochastic effects play an 
important role when v and v* are at their minima. The damping of the oscillations means that 
the minima become less extreme with each passing cycle. Consequently, as we shall show, if 
v or v* are not lost during the first cycle, then ( 12.11 ) will reduce to ( 12.3b and stochastic effects 
can be ignored for the rest of the considered time interval. 

From a genetic perspective, the behavior of v and v* during Stages II and IV correspond 
to severe bottlenecks. We will explore the effects of these bottlenecks on the lineages of a set 
of samples taken at a time 0(4r) ti me units after the escape mutation occurs. 
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3 Results 



We are interested in determining the probability of wild type loss, mutant type loss, or co- 
existence of the two types. If the wild type is lost, then the steady state of ( 12. It is given by 
um = (0,/,0). If the mutant is lost then the steady state is uw = ( y-^j , 0, jt^)- And finally 
if coexistence occurs then the steady state is uq = (oc(l — /),/— a(l — /), 1 — /). 

We further discern between two types of mutant loss. The mutant may be lost immediately 
after the initial mutation occurs, that is before the mutant population reaches a significant 
proportion of the virus population (this will be made precise shortly). We refer to this as 
failed mutant dynamics. Or, the mutant may rise to significant population levels but then 
subsequently be lost. We refer to this as the lost mutant dynamics. 

Our first result characterizes the probability of these events occurring in the limit of V — > 
°° and the scaling e = O ( ) . We consider the system up to time t f such that tf = O ( Jj- ) and 
we show that by time tf the system is arbitrarily close to a steady state. Below, when we write 
P(u (ui,U2,ut,)) = p we mean that for any fixed constant c, we have limy^ m P(\\u(t f) — 
(Ml,M2,M3)||oo < c) =p. 

In order to state our result we need the following definition, 



-1 



(«,/) = 



log(- 



(/-«(! _/)) + log( 



1 



(!+«)(!-/)) 



(3.1) 



aH 



where H solves the following equality, 



(3.3) 



Notice that H is a function of / and a although we do not make this dependence explicit. 



logV 



Theorem 1. Set e 



will mean, sup,/ <rf V*(r) < £ 



and tf = 4j where j3 and t are held fixed. By a failed mutant we 



If 


Then in the limit V — > °° 


P ~ 


P(u ~» Um) = 1 - Pfailed 
P(u ~» Uw) = Pfailed 


^f f) < 1 and v ""f' f) > 1 


P(u uw Jailed mutant) = p failed 
P{u ~> uw, lost mutant) = 1 — Pf a u e d 


*i'»f J1 < 1 and v "^- n < 1 


P(u w Uc) = 1 - Pfailed 
P(u -^U W )= Pfailed 



where, 



Pfailed = exp[- 



4(/-«(l-/)) 1 



(3.4) 



(k* + l)(a + l)' 

Figure [3] graphically displays the results of Theorem Q] (the case of a failed mutant is 
ignored). The relationship between 0i; m and i//i; m seen in Figure[3]is general with 0]; m always 
increasing and diverging at / = 1 and y/hm always possessing a single maximum and equaling 
zero at the two possible endpoints of /: and 1 . We see that in the scaling of TheoremU 
the end state of (12. U is completely deterministic. 
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If we allow /3 to scale with V we can analyze the boundaries of the results given in 
Theorem[T] Define w(t) to be the solution of the following Feller diffusion at time t : 

dw = y/wdB. (3.5) 
w(0) = 1. 

Let f(a) be the solution to 

<km(a,f) = Wiim(aJ)- (3.6) 
Theorem 2. Set f = f and <j>n m = $u m (<X,f). Set, 

1 iloglogV log(jCv^ii 



£ -^'"[lo g Y + VgV) 2 (logV)^ i3 ' ?) 



andtf = \, Define, 



2,71 cc 

r -=ll«W (H1)K ' (18) 



2n .1/ l + o(l+H) \ u , .« 



V (!-/)(/-«(!-/)) wy 7 / V(l+«)«(l+^), 
where rj/y « fl random variable with distribution, 

T}iV=w[T wad ]. (3.9) 

Further set, 

p w =P(w[T wild ]=0), (3.10) 

p M = f (w[r muMnf ] = o) 
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Then in the limit o/V — ► °°, 

P(u uw Jailed mutant) — pfaiied (3.11) 
P(u ~* u w , mutant lost) = (1 - pf a u e d){l - Pw)Pm 
P(u ~> u M ) = (1 - Pf a u ed ) p w 
P{u -w Mc ) = (1 - pf aUed )(\ - p w ){l - Pm) 

Our techniques allow us to derive similar results for the cases f < f and f > f. Notice 
that in Theorem [2] the end state of ( 12. U is truly stochastic. The scaling of e in Theorem |2] 
shows that the stochastic regime of ( 12.11 ) scales as O(jjiy) about (j). And indeed, the same 
can be shown to be true about y. 

We express our genetic results within the context of population sampling. To make things 
somewhat concrete, we assume that a sample of n infected cells is taken at time tf. The 
genetic composition of this sample of this sample across the HIV genome can be determined 
if one knows the lineages formed by these n samples and the mutations that occur on these 
lineages. The lineages and mutations associated with a sample are not deterministic and 
hence must be specified through a probability distribution. 

In this work, we do not determine the full distribution of the lineages and mutations 
associated with the n samples (although our methods should allow for this). Rather, we 
characterize the state of the lineages formed by the n samples at time 0. To explain this 
precisely, letyi ,y%, ■ ■ ■ ,y„ be labels for the n infected cells sampled at time tf. At time these 
n samples will have some number of ancestors, say «o, and we can arbitrarily label these 
ancestors z\ ,22, ■ ■ ■ ,z no - Each z, will be the ancestor of a certain number of the n sampled 

cells, let this number be B,-. Then we have B\ + B2 H hB„ = n since every sample y, must 

be descendant from some zj- Our results specify the distribution of (nQ,B\,B2, ■ ■ ■ ,B„ ). 

The Kingman coalescent is a specific probability distribution for the lineages of a set of 
samples corresponding to the lineage distribution in a classical Wright- Fisher population Jf^. 
For any time f, we let Yl(t;n) be the lineage distribution specied by the Kingman coalescent 
t time units prior to sampling of n individuals. The distribution of Yl(t;n) is well understood 
Jit I22I]. Our results will show that the distribution of («o,#i,#2, ■ ■ ■ >-^«o) ' s me same as tnat 
specified by the Kingman coalescent run for some time r gen etic. In this wa y> we show that our 
viral population run for time tf will have a similar level of genetic diversity as a Wright-Fisher 
population run for time r genet i c . 

Our results will actually split into cases depending on whether the wild type is lost, mu- 
tant type is lost, or coexistence occurs. In the case of one type surviving, we will be able 
to characterize the genetic diversity by a single Kingman coalescent started with n individu- 
als. However, when the types coexist, we need two Kingman coalescents to characterize the 
resultant distribution. 

The genetic results follow the same scaling regimes as specified in Theorems Q] and 
12 However, for clarity of presentation we simply state the results associated with the pa- 
rameter regime given in Theorem |2 The following theorem shows that the distribution of 
(no,B{,B2, ■ . . ,B n ) is a function of four random variables defined in the theorem: w\ , w%, £, %. 
We state the Theorem and then help the reader parse the results. 

Theorem 3. Let f,£,T w ud,T mutant ,pf a u e d be as in Theorem\2\ Let w\,W2 be independent 
versions of the Feller process w specified in ( 15.51 ) and let the Feller process in the definition of 
Tmutant be w\. Let £ be a uniform random variable on [0, 1] and let ^ be a binomial random 
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variable with n trails and success probability — j — . Define the times 7 ^genetic, l?^ ^genetic ,2 
according to the following table (in the cases where the entry for Tg enet \c.2 i s a ~> ow (y TgeneticA 
is defined.) 



If 


Tgene1ic.\> Tgenetic.2 


C < Pfailed 


0, - 


C > Pfailed,Wl[T wad ] = 


oo — 


C > Pfailed,Wi [T wiU ] ^ Q,W 2 [T mutant ) = 


T io ds Wl \r(k+i)s]' 


C > Pfailed, W\ [Tnild] 0,W2[T mut ant] ^ 





where T = y g ^ 1 ( t^j ) £• ^ n cases where Tg enet ic2 is not defined then, 

lim (no,B h B 2 ,...,B llo ) = Yl( ) (3.12) 

In the case where Tg e „ et j c 2 is defined then, 

lun (n ,Bi,B 2 ,...,B no ) = {H{T genetic x^),n(Tg enetic ^n- ^)). (3.13) 

Both of the above limits are meant in the sense of convergence in distribution. 

To explain Theorem [3] we explain the reasoning behind the table. Take first the case 
C < Pfailed- In this setting, the mutant fails to enter the viral population and essentially the 
population stays in v,p equilibrium. No bottlenecks occur and so the n lineages, run for 
time O(-^t) in a population of size 0(Y) will not converge. This is the same as running the 
Kingman coalescent for zero time. In the second case, £ > Pfaaedi w l Pwild] = 0, the wild 
type will be lost. In this case all lineages must come from the original escape mutant. This 
is the same as running the Kingman coalescent for infinite time to guarantee convergence of 
the lineages to a common ancestor. The third case, £ > ^failed, W\ [r w iid] ^ 0,W2[r w iid] = 0, 
corresponds to the loss of the mutant type. A bottle neck occurs in the wild types during 
Stage II and the mutant is lost during Stage IV. The bottleneck has an effect on the lineages 
equivalent to running a Kingman coalescent for 0(1) time. The expression for 7^ e netic,i gives 
the precise length of time the Kingman coalescent needs to be run, notice that r ge netic,i is 
stochastic. Finally, in the case £ > pfailed, wi [r w ;id] ^ 0,W2[r w ;id] ^ 0, both the wild and 
mutant types survive. In the new equilibrium, the fraction of wild type cells will be a ^7^\ 
so if we randomly sample at time tf, | gives the probability distribution of the number of 
wild types sampled. Working backward in time, all mutant samples come from the original 
mutant and hence must coalesce by time zero. r genet i c 2 reflects this, we run the Kingman 
coalescent infinite time for the part of the sample corresponding to mutant samples. 7g en etic,i 
represents wild type samples, and the bottleneck through which they pass in Stage II has an 
effect on the lineages equivalent to running a Kingman coalescent for time r gen etic,i. 

4 Discussion 

In this paper, through a specific model of MHC I escape mutation, we have attempted to 
emphasize the important interaction between population dynamics and genetic evolution. In- 
deed, as we have demonstrated, the initial escape mutation brings about oscillatory dynamics 
in the population sizes of the wild and mutant type cells. These oscillation then impact the 
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lineages of sampled cells, implying that selection by CD8 cells will affect genetic diversity 
over the whole viral genome. 

Several authors have explored the issue of intrahost HIV effective population size Jl5l : 



21j;ll3|]. Effective population size, as opposed to census population size, is a measure of the 



potential genetic diversity of a population. High effective population sizes imply, at least in 
the presence of significant mutation, high levels of genetic diversity. The HIV population is 
enormous and so one expects the effective population size to be large and, in turn, genetic 
diversity to be large. However, experimental data seems to suggest a small level of genetic 
diversity and hence a small effective population size II 151 . In this work, we have demonstrated 
that the notion of an effective population size for HIV in the presence of the predator-prey 
dynamics of CD8-HIV interaction is subtle (see |13[] for a qualitative discussion with similar 
observations). The intuition that effective population size and census population size are 
directly related is an artifact of considering classical Wright-Fisher populations, and HIV is 
certainly not such a population. Our results show that significant bottlenecks occur in the 
HIV population due to CD8 attack, and that the size and time lengths of these bottlenecks is 
determined not by the population size of HIV but by the rate of CD8 cell response. In our 
model e plays the role of CD8 response rate and V plays the role of HIV census population 
size. We find that if £ ^ log V then lineages do not coalesce over time 0{\). And, in fact, 
one can show that lineages will coalesce in time 0(V) which leads to an effective size of 
0(Y). However if e < 0(\og V) then lineages coalesce in time less than 0{\). In fact, one 

can show that the bottlenecks last for 0(-j=) time which leads to an effective size of 0(4|). 

In this paper we have not considered the mutation rate. While effective population size 
gives some idea of a population's genetic diversity, to fully model genetic evolution one needs 
to construct a model of lineage distribution, that is a coalescent, and analyze mutations on 
that lineage distribution. The strength of mutation then plays a central role in the pattern of 
genetic diversity. Data suggests that escape mutations fix or are lost in a matter of weeks. 
For a sample of size n the expected number of mutations that take place before the HIV 
population returns to equilibrium will be bounded by O(^) where jj. is the mutation rate 
of HIV. /x for a single nucleotide is on the order 10~ 5 , and in this setting we can safely 
assume that no mutations occur during the escape mutation. In such a case our results give 
a complete answer to the effect of our model on genetic diversity since we do not need to 
know what happens between the time of the escape mutation and the return to equilibrium. 
However, if one considers the whole genome, then is on the order of and we must 
account for mutations. Our approach should allow for such an analysis, but in this paper we 
have not pursued this important issue. 

The genetic diversity of HIV over the course of the chronic phase of infection has been 
shown to initially rise, eventually level off, and then finally drop [22]. The drop of HIV 
genetic diversity is correlated with the onset of AIDS. We speculate that one possible expla- 
nation for this evolution of diversity involves the collapse of the immune system. Our model 
suggests that a fast immune system will allow for more diversity. Indeed, when our e is 
large, Theorem[3]with K — > shows that the lineages do not coalesce and so genetic diversity 
increases with time. But, as CD4 cells counts collapse, we speculate that the rate of CD8 
response will also collapse and e will drop. From Theorem[3]we see, by taking K — > 00, that 
a small e forces lineages to coalesce and genetic diversity will drop. This line of reasoning 
is, of course, highly speculative since the loss of CD4 cells will impact CD8 response and 
HIV evolution in many ways that our model does not consider. And of course we are consid- 
ering CD8 attack at only a single epitope while HIV is typically subject to attack at multiple 
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epitopes. 



5 Proofs of Theorems 

In this section we prove Theorems Q] [2] and [3] As we mentioned in section [2] we analyze 
the stochastic system ( 12.11 ) by comparing it to deterministic system ((23). More precisely, in 
section [6] we consider the deterministic system in Stages I-IV. For each stage, we determine 
an asymptotic expansion in e of u at the end of the stage. So, for example, in stage II we 
assume 5(7}) as given and determine an expansion for 5(7//). In section|7]we consider the 
stochastic system through Stage I-IV. Roughly, the idea will be that in Stage I and III, u and u 
behave, with high probability, almost identically while in Stage II and IV, u and u differ due 
to stochastic effects. 

The main technical ideas needed to prove our theorems are contained in the lemmas of 
section|6]and|7] In this section we put these lemmas together to prove the theorems. By doing 
so, we hope to provide the reader with an underlying intuition for the more technical lemmas 
found in sections [6] and Q Before beginning in this task, we need to precisely define Stage 
I-IV and the initial stage. We set for the deterministic system, 

7} = inf{f >0: v(r) = £ c/ }, (5.1) 
T„ = mf{t > 7} : v(f) = e q }, 
f/// = inf{f>f//:v*(0=e 9 }, 
f/y=M{*>7}/,:v*(r)=£«}, 

and similarly for the stochastic system, 

7/ = inf{f >0:v(f) =e q }, (5.2) 

Tn = inf{f > Tj : v(f) = e q }, 
T m =mf{t>T n :v*(t )=<*}, 
T IV =mf{t>T II r.v*(t) = ei}, 

where q = 4. In section|2]we stated that in section I, v collapses to o(l) levels. What we 
meant, as seen from the definitions above, is that v collapses to the value of e q . Essentially, 
q is chosen so that when a variable falls below e q ', its effect will not be felt in the V — > °° 
limit. For example, in Stage II, v < e q and the result is that the dynamics of p, given by 
p = ep(v — ap) can be reduced to the integrable p — —zap 1 . Set, 

r s = inf{f : v* £ (0,e q )} (5.3) 

T s is considered only in the stochastic system and so T s — T s . Indeed, we will simply start the 
deterministic system at time T s by setting u(T s ) = u(T s ). 

We now proceed to prove Theorems Q][3] Theorems Q] and [2] have similar proofs, so 
we prove Theorem [2] and within that proof we comment on the connections to the proof of 
TheoremQ] 

Theorem\2\ To help the reader parse our arguments, we decompose this proof according to 
which stage we consider. At the end of the proof we consider a full cycle. 
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Initial Stage: 

We start by considering the initial stage. From ( 15. 3t we have three possibilities. v(T s ) = e q , 
v(T s ) = or r s = oo. Lemma lTTTl shows that P(T S = °°) = and rimv->oo.P(v(7' s ) = 0) = ^failed. 
So with probability pf a ii e d the stochastic system never exceeds £ c/ and we have a failed mutant. 
Lemma l77Tl also shows that v,p do not deviate beyond 0(e q ) from v(Q),p(0) by time T s , so if 
a failed mutant occurs, the system returns to state uw- 

Stage I: 

If the mutant does not fail, then Stage I starts. We take u(T s ) = u(T s ). Lemma 17721 gives for 
Stage I, 

P( sup ||«(0 - fi (0||. > \) < 0( Ctog^vy (logV)' (5 4) 

T s <t<T, VS 

and Lemma loTTl gives the asymptotic of u, up to 0(e), at the end of stage I. 
Stage II: 



Lemma 16721 shows that for the deterministic system v reaches an absolute minimum in Stage 
II. In section |7j we label that minimum as ^ and we are able to asymptotically compute its 
value as a function of p(T[). To emphasize this we write, ffiMSli, j n Lemmas 17.3117. 51 we 
define an interval [to,t{\ centered about ^MSIJ an( j f width J^. For our proofs to work, 
m can be taken as any value between | and |, but intuitively what matters is that the width 
of [*o,*i] be much greater than -^=. Lemma 17731 shows that on [7},fo], \u — u\ is bounded by 

O(eV) with probability 0(e 2 ). Lemma 17751 shows the identical conclusion for the interval 
[fi,7}/]. Lemma 17741 shows that inside the interval [fo,?i] stochastic effects matter and u may 
deviate from u. Indeed, Lemma 17741 gives. 

lim P(vis lost in [t ,h]) = j im P(w[V2n(k+l)Zu] =0) (5.5) 

where, 



v a(l-/) 21 Veiy/e ' ( ' 

The function <j> in (15.6b is defined in section |7J We caution the reader that 0i; m is not <$>, 
essentially <j) is log(4&) although we define <p somewhat differently for technical reasons. 
The connection between and 0i; m is given by the following relation which is justified in 
Lemma l774l 

-H'f ) = ^-?|loge|+log( r ^) + 0(et|logc|). (5.7) 
This relation leads to, 

e X p[- (*MZm ) ] = exp[^( T ^)(l + O( e i|log e |)). (5.8) 
Plugging ( 15.8b into ( 15.61 ) and using the scaling of e in Theorem[2]gives, 



lim 5// = J 1 (-g-)ic (5.9) 
v^»o y a(l -/) 2 1 + a 
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Plugging ( 15.91 ) into ( 15.51 ) gives 



lim P(v is lost in [f ,fi]) =P(w[r wM ] = 0) 
We note that in Theorem[TJthe scaling will force the following duality in S//, 



(5.10) 




(5.11) 




(5.12) 



and this is the essential difference between the results in Theorems[T]and|2] If v is lost during 
Stage II, it is straightforward to show that the system must go to %. 



If the wild type is not lost in Stage II, then by the same arguments used in Lemma |T2l to show 
that u ps u in Stage I, we can show that u sa u in Stage III (with the precise statement being 
identical to that found in ( 15.4b ). 



The arguments of Stage IV are almost identical to Stage II, except that v* is now what col- 
lapses rather than v. We define a function y in section [7] that plays the role (j) did in Stage II. 
And the relationship between iff and i//i; m is completely analogous to the relationship between 
<j) and 0ij m . If the mutant is lost then the system goes to uw- 

Behavior over a full cycle: 

If both the mutant and the wild type survive the first cycle of Stages I-IV then our claim is 
that the system gets arbitrarily close, at least to 0(1), to uq. This is a result of the damped 
oscillations seen in FigureQ] To demonstrate the damping of the oscillations and their impact 
on the probability of losing v or v* in a given cycle, we first note that by Lemmas |6 . 1 1 and [6731 
p changes by only 0(e|loge|) during Stages I and III. And since u and u are linked during 
those stages through d5.4t the same will be true of p and p. In Stage II, since v is 0(e q ) we 
will have p = -sap 2 + 0(e q+1 ). In Stage IV we will show that while v* < 0(e q ) we have 
v—(l—p) = 0(e). From this we have p = ep(l — (1 +a)p)+0(e 2 ). Both Stages II and IV 
can be shown to be of duration O(^) and so we can explicitly integrate p through the cycle 
with an error term 0(e). In this case we find < piTjy) — (1 — /) < p(T s ) — (1 — /) > 0. 
Essentially we have shown that the starting point of our cycle is damped towards (1 — /) in 
subsequent cycles. 

To connect to the probability of loss, consider the impact of this damping on 0i; m . 0u m 
can be thought of as a function of p(T s ) since ^( S "^ T ^ ) is, through this connection we can 
consider how 0ij m changes with p(T s ). Explicit differentiation gives, 



Stage III: 



Stage IV: 



dp(T s ) p 2 (T s ) 



<0 



(5.13) 
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So with each progressive cycle, 0i; m will be 0(1) smaller. The same can be said for \]f. To 
see the effect of this, let 0^ be the value of 0ii m corresponding to the first cycle, i.e. exactly 

(2) 

0ii m , and li]T | be the value of <j>n m corresponding to the second cycle. Then we have, 

, (2) ,(1) , . v ,(1) 
r 'lim l av „r 'lim l / ~ vrt [ s~if Ml ^ vrt [ ' li m " 



exp [l|s-] = rappjjSL] I exp[-0(~)M = exp[-^] ^exp[-0(logV)] j . (5.14) 

If we follow the arguments that led to ( 15.121 ) we see that if v is not lost in the first Stage II it 
will not be lost in subsequent stages, and similarly for v* . 

Now note that we set tj = 0(-^). We show in sections [6] and [7] that a full cycle takes 
O(^), so if v and v* are not lost we are considering O(^) cycles. Since we have shown that 
p(Tjv) — (1 — /) < p(T s ) — (1 — /) we see that with every cycle the next p(T s ) gets 0(1) 
closer to 1 — /. This implies that by the time interval [0(\),tf\, p — (1 — f) will be o(l). It 

is then easy to show that v and v* get within o(l) to their values in uq. And so u gets within 
o(l) of He- 

Finally, our analysis of the different stages contained expansions with error terms o(e) 
and excluded sets with probability 0(e 2 ). Since by time tj we have gone through at most 
O(i) cycles, if we now take V — * °° which takes e — * our arguments become true with 
probability approaching one. □ 

Before finishing with Theorems Q] and [2] we return to an issue raised in section [2] In that 
section, we stated that the stochastic terms involving p had been dropped in (12. j} , but that our 
results would be unaffected. Indeed, v and v* are only affected by stochastic terms when they 
drop to low levels in Stages II and IV. p never experiences such bottlenecks, so the stochastic 
terms in the p equation will have no effect. If we included them we would need a Lemma 
similar to Lemma I7T21 But this would just add technicalities to our discussion. 

We now prove Theorem[3] Before proceeding we explain how we build sample lineages 
on the population process u(t). These justifications are similar to arguments found in 12411 . 
Take two times t and t+At. Suppose at time t + At we have n' sample lineages of wild type. If 
a birth event happens in the underlying birth-death process during [t,t + At] then the number 
of lineages may drop from n' to n' — 1 . Indeed, if the new children formed by the birth event 
are both part of the n' samples at time t + At then we have a coalescent event and at time t 
there will be n' — 1 lineages. By symmetry, given a birth event there is a probability 7y~7jyj2 
that a coalescent will occur (recall that Yv(t) is the number of wild type cells at time f). 

To compute the probability of a coalescent event, we need to know the probability of a 
birth event in [t , t + At ] conditioned on the value of u (t + At ) . Set u (t + At ) — u — (v,v*,p). 
Then an application of Bayes rule gives 

p/i • ,i i ft ^ AA r , P(birth | u(t) = (v- 1 v*,p))P(u(t) = (v- ±,v*,p)) 

P(birth u(t+At) = u) = — . — * (5.15) 

v 1 v ' 1 P(u(t+At) = u) 

Vv-1 



P(u(t + At) = u | u(t) = (v- ^,v*,p)) 
= Vv-l + 0(Af). 



From these arguments we have that the probability of a coalescent event in [t,At] is 
"y"^ + Q( ^yL^ ) +0(At). We note that this approximation breaks down when Vv(f ) = 
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0(1). However, we only use these rate computations if the wild type is not lost, otherwise 



wild type lineages do not exist. And by the computations of Lemma 17.41 in this case we 
always have Vv = 0(-^=) so our expansions are valid. 

Finally, before proceeding to the proof of Theorem[3]we connect our lineage distribution 
to the Kingman coalescent. A well-known result in coalescent theory states that if lineages 
coalesce at rate, say, r(t) over an interval [0,T], then the lineages at time zero of this time 
varying coalescent process will have the same distribution as the Kingman coalescent run for 
time fjdsr(s) ll23ll . Essentially, in the Kingman coalescent lineages coalesce at rate 1, so 
by a time rescaling one can produce coalescent events at the correct rate, r(t). For us, the 
consequence of this observation is that the distribution of wild type lineages will be given by 
a Kingman coalescent run for time J f ds^^. Hence, in the proof of Theorem[3]we focus 

on the quantity Jo ds^rpj 

Proof of Theorem\3\ We recall the notation (no,B\ ,Z?2, ■ ■ ■ ,B„ ), introduced in section[3] rep- 
resenting the number of lineages left at time zero from n samples at time tr. We first consider 
the following cases: mutant fails, wild type loss, and mutant loss. 

If the mutant fails then Lemma|7?T] shows that v(f) = jf^ + 0(e q ) for all t S [0,f/]. Then 
we have, 

f'f 1 1 
lim / ds—— = lim 0(^— = (5.16) 

v-Wo Vv(j) v^»o Vv 

If the wild type is lost then, since there is only a single mutant at time 0, «o = 1 and 
B\ = n. This is the same distribution at II(°°;n). 

Now we consider the case of the mutant type being lost. First we claim, 

f'f 1 t*\ 1 

lim / ds—— = lim / ds—— (5.17) 

v^~y Vv'O) v^°% Vv(s) 

where recall that [?o,?i] is an subinterval in the Stage II in the first cycle of Stages I-IV. To see 
this, first note that in Stages I, EH, and IV and in the initial stage we have v > £ q . So since the 
duration of these stages is 0( |) we have, 

lim/" ^_!_ = Um 0(— L-) = 0. (5.18) 

On [Tjy,tf] the mutant is already lost and v(t) = 0(1), so limy^^ Jj^ r j ds^jr = 0. We are 
left to consider [Tj, 7}/] during the first cycle. The key relation is ( 17.681 ) in Lemma l7~4l which 
allows us to compute v(t) for t 6 [to,h]. By plugging in t = ^ in ( 17.681 ) we arrive at, 

v(r Hv(^)exp[a(l-/) 2 ^n] (5.19) 
We now wish to demonstrate the following bound, 

Tl dS V^)=L dS Y^) + ° { ^o) ) = i ds V^) il + 0i i )} ' 

(5.20) 



To see this first note that v is always bounded by 1. So at least for one time unit, v(f) is of 
order v(^-). Then we recall from the proof of Theorem[T]that v is well approximated by v in 
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Stage II outside of [fo^i]> an d that v is strictly decreasing on [7},fo] and strictly increasing on 
[fi,f//]. Since Stage II is of duration O(^) we must have, 

r 1 1 

JfrMA] S V*S) - SM¥) ■ (5 - 21) 

From the above bound and ( 15.191 . ( 15.201 follows. Summarizing, we have shown, 

ft/ 1 rti 1 

lim / ds—— = lim / ds——-. (5.22) 

v-Wo Vv(j) V-W*, Vv(j) 

Finally, LemmaElshows j£ dj^jj -> Y J ^[r(fc+i)i»1 • 

We have left the case of the wild type and mutant type both surviving. In this case, 
the computations are simply a combination of the wild type loss and mutant type loss case 
after we split the sample into wild and mutant samples. We have v(tf) = a(l — f) and 

v*(tf) =/— Ot(l — /). So the probability of drawing a wild type is, as V^°°, exactly a ^J^ 
and the number of wild types out of n sample is binomial with n trials and success probability 

«(!-/) ... 

/ ' U 

6 The Deterministic System 

In this section we consider the deterministic system ( 12.31 ). Our goal will be to find asymptotic 
expansions for u at the end of each of Stages I-IV and to develop estimate of the u dynamics 
during the stages as well. We define 5(f) = p(t) — (1 — /). 

6.1 Stage I 

Recall that Stage I is given by the interval [Tj,Tjj]. In this subsection, for notational conve- 
nience, we set 5 = 8(T S ). We assume v*(T s ) — e q , v(T s ) — (1 — p(T s )) < 0(e) (this assump- 
tion will connect to Lemma[7j]found m section|7] 

Lemma 6.1. T, - T s = 0(^|M) and, 

\f-t{T I )\=0{e«), (6.1) 

p(T,) = p(T s ) + ^^1a PI)1 + Ja/7,,2 + 0(e? |loge| 2 ) (6.2) 

where, 

Api,i=p(T s )(l - (2+a)p(T s )), (6.3) 
Api,2 =p(T s )(l - (l + a)K^))log(/) -o/(r,)log(l -p(T,)). 

Proof. We will separate [T s , 7/] into three intervals using stopping times T\ , T2 where, 

T 1 =M{t-T s :v*{t) = Ve}- (6.4) 
T 2 = inf{* - T s : v(t) = Ve} (6.5) 
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Our first task is to determine T\. Consider t £ [T s , 7}]. For such t we have, 

(1 - p(T s ) - v)= -v(l -p- v-v*) = -v(l - p(0) - v-v* - O(et)). (6.6) 

where we have used the fact that p(t) — p(T s ) < et. From d6.6l > we see that if (1 — p(T s ) — v) > 
0{max(et, v*,e)) then (1 - p(T s ) - v)< 0. Since (1 -p(r s ) - v(T s )) = 0(e) we can conclude, 
|l-p(r B )-v| = 0(max(£f,v*,e)). Using (ESI we have for f e [r s ,r s + (7i A0(|loge|))] 

v* = v*(/-v-v*) > v*(8 + 0(Ve) (6.7) 

From ( 16.71 ). we see that T\ = 0(\ loge|) and so we have exp[(5 + 0(Ve))Ti] = . Solving 
for T ] we arrive at T\ = | log £ | + ( yfe | log £ \ 2 ) . Integrating p we find, 

p(7i) = /5(r s ) + ep(r s )(l - (1 + a)p(T s ))T l + 0(e? |loge| 3 ) (6.8) 

)£l 
5 



: + {q 2) ! |1 ° g£| J p(7;)(l - (1 + a)p(T.)) + 0(ei |loge| 3 ) 



Now we consider the time interval [T S + T\,T S + T2\. Let r(f) = - p ^|| Then direct 
computation gives, 

f=(l-p(T s ))-fr+^(p-p(T s )). (6.9) 
Integrating through we have, 

r(t) =r(T s ) e xp[-ft] + (l-p(T s )) f dsexp[-f(t-s)] (6.10) 
' <fa«p[-/(f - s)} (p(s) - P(T s ))(^)(s) 
By differentiating log v + log v* we can arrive at, 

£(f) = ^(r s )exp[-5(f-r s ) + 0(ef)] (6.11) 
Plugging this relation into the second integral term in d6.10l > gives 

| j^e X p[-/(f-,)K;^)-p(^ 

(6.12) 

Noting that r(T s ) = 0(1), p(t)-p(T s ) < et and using ( l67T2b in ( RHOb with f = T s + T\ gives, 
r(r, + Ii) = ^ + 0(e^) (6.13) 

= l Z |W +0(Vi | logc| 3 ) _ 

Now again we use ( 16.10b . but instead of integrating from to t we integrate from T s + T\ to 
f € (T s + 7i , T s + T 2 ). In this case, since v* > y/e, 

dse^[-f(t-s)](p{s)-p{T t ))(^)(s)\<0(Ve\loge\). (6.14) 
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which leads to 

r(t)= l ~ P f (Ts) +0(Vt\loge\ 3 ). (6.15) 
Exploiting (16.151 allows us to integrate v explicitly. We have, 

v(r s + r 1 ) = (i-/3(r s ))-( 1 ~^ (rs) )V^+o(£|iog £ | 3 ). (6.16) 

v*(0 = / - ( - f -, T , Mt) + 0(e\ loge| 3 ). (6.17) 



Substituting these expressions into ( 12.31 ) gives, 



5 

-v 



(l-/3(r s ) + 0(e|loge| 3 ))-v). (6.18) 



i-p(r.) 

We can explicitly integrate (16 -18b - 

((i-p(r s )) 2 + o(V?))exp[-5( f -r 1 )] 



?(0 = 



v^ 1 ^ 1 ) + 0(e| loge| 2 ) + (1 -p(T s ) - M 1 - 2 ^ 1 ) + 0(e\ loge|2))exp[-5(r - 7\)] 



(6.19) 

Setting v(7i) = v 7 ^ an d solving for t gives, 

7 , 2-7 , i-^|loge|-^log(/(l-p(r s ))) + 0(^). (6.20) 
and explicit integration gives, 

Tl dsv(s) = 1 ~ ^ I log e| + 1 ~ piTs) log(/) + Q(Ve) (6.21) 
rj 2o o 



Using ( 16. 2U we find, 



ellogel/WXl-^r,)) 



p(r s + T 2 ) =p(T s + 7i) + -L^-L [ 2 - a P (T t ) ) (6.22) 

+ ^(p(T s )(l-p(T s ))log(f)-ap 2 (T s )log(f(l- /HIV, n j - (){e*) 

Finally we consider [T s + 72,7/]. The analysis in this case is almost identical to that of 
[T S ,T S + 7i] and we find: 7/ - T z = ^i|loge| + 0(y/e) and 



= p(T s + T 2 ) - [ap 2 {T s ){q I)) + 0(e* | loge| 3 ). (6.23) 

Combining our estimates of /5(r s + Ti),p(T s + T2),p(Tj) gives (16.2b . Finally we note that 
our arguments show that for all t G [T s ,Tj] we have v(f),v*(f) > e q . □ 
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6.2 Stage II 

From LemmaloTTIwe can assume 5(7}) > 0, v(f/) = £ q , and |v*(fj) - f\ = 0{e q ). 
Lemma 6.2. Let H u be the solution of (I - f)H n = ^log(l + ap(Ti)H u ). Then, 

\(T II -T I )-^-\ = 0(e"- 1 ) (6.24) 
1 + ap(Ti)Hii 

For t E [7}, 7}/] we have, 

sup|v*(0-/|<O(£ 9 ) (6.26) 
1 



v(f) = v(7})exp 



(l-/)(f-7»-_ log (1 + ea^(r/) (;-?})) +0(e 



Proof. We first show that on [7}, 7}/], p has simple dynamics. Indeed, for all t E [7}, 7}/], 

/3 = -eap 2 + Ce 1+ ' 7 , (6.27) 

From this we have, 

Now consider v* for t E [7}, 7}/]. 

/- -^=-v*(/-v-v*) = -v*(/-v*-Ce (/ )- (6.29) 

where C< 1. Since |/-v*(7})| = 0{e q ), from ((OS we can conclude that \f - v*\ < 0(e«) 
on [7/, 7}/]. 

Now we bound 7}/. We have for t E [7}, 7}/], 

9 = v(l-f-p + 0(e*)) (6.30) 
Plugging ( 16.28b into the relation directly above leads to 

v(0 = v(Z»exp[(l -f)(t- 7}) - ^log(l + eap(7})(f - 7})) + 0(e«f)] (6.31) 

Since p(7}) > 1 — /, a simple Taylor series argument shows, 

v(7} + ^ - O^ 1 )) < < v(7} + + ^(e^ 1 )). (6.32) 

So we can conclude that 7} + - 0(e' 7 " 1 ) < 7}/ < 7}+ ^ + O^ 1 ) and the lemma fol- 
lows. □ 
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6.3 Stage III 

From Lemma |6\2l we can assume 5(7}/) < 0, v(T n ) = e q , and \v*{T n )-f \ < 0(e q ). For 
convenience in this section set 5 = 5(7}/). 

Lemma 6.3. 7}// — 7}/ = <9(^fp)> 

\v{T m )-(l-p(T m ))\=0(e), (6.33) 



where, 



p(T in ) = p(T n ) + qe ^°f^ Ap IIIA + ^Ap ni . 2 + 0(e?\loge\ 2 ) (6.34) 



Ap// /j i=p(f//)(l-(2 + a)p(7}/)), (6.35) 
Ap///,2 =p(f//)(l - (1 + a)p(T u ))\og(f) - ap 2 (T n )\og(l~p(T n )). 



Proof. The proof of the following lemma is almost identical to the of Lemma 16. II The only 
difference involves the proof of (16.33b . Set Av = v ~(^~ p ) . Tnen direct computation gives, 

vv* 

Av = -vAv — — + p(v - ccp) (6.36) 



We define 7} , in the same manner as in Lemma 16.11 except that the role of v and v* are 
interchanged. That is, v(7}) = y/e, v*(72) = y/e. Essentially the same arguments we used in 
Lemma loTTl involving r(t) can be used here to show Av(72) < 0{-^). 

We now integrate (I6.36l l from T% to 7}//. First notice that in this range we have, 

A'v = -vAv - 0(—=). (6.37) 

From which we see that forf S [T2, 7}//], Av < (9(-^) and so v = 1 — p{Tii) + 0{yfe). Now we 
proceed to integrate (16.36b using the integrating factor exp[j|^ dsv(s)\. Since everything we 
do below is in terms of orders, we can replace this integration factor by exp[(l — p(Tu))(t ~ 
T2)]. This gives, 



ds?(s)exp[(l-p(Tn))(Tm-s)] + 0{l). 



Av(T m ) =0(Av(r 2 )exp[-(l - p{f n ))(f in - T 2 )}) (6.38) 
1 r T "' 
e Jt 2 

By the same arguments as in Lemma loTTI we can show v*(s) = 0(yfeexp[— 8(s— T^)]) and 
Tm-T z = ^l|loge|. Plugging all this into (|631 finally leads to Av(f// 7 ) = 0(1). □ 

6.4 Stage IV 

From the results of Lemma 16. 3 1 we can assume 5(7}//) < 0, |v(7}//) — (1 — p{fm))\ = 0(e), 
and v* = e q . 
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Lemma 6.4. Define Hjy as the solution of the following equation, 

(1 -f)H IV = Y^log (l + (1 + a)p(T n ,)(exp[H IV } - 1) j . (6.39) 
Then |(f/y - tin) ~^r\ = 0(e) and 

P(3Hr) - l + a+a^lexp^]-!) + 0{£) - (6 - 40) 



For f e [Tin, 7/v] we /iave, 



|v(0-(l-p(f)|<O(e) (6-41) 
v*(0=v*(7)//)expfe(f) + O(e 2 f)] 



w/zere, 



=- (1 -m-tm) + (1+ 1 a)£ log (l + (l + a)p(tiii)(exv[e(t-tni)] - 1) 

(6.42) 

+E( T^ir )exp ^ fffl) ^ (t)1 ' 

Proof. We start by considering v — (1 — />). Recall the definition of Av from Lemma 1631 By 
the same arguments as in Lemma 1631 we can show that Av < 0(1) and hence v = (1 — p) + 
0(e). However, to compute v* to 0(e) we need v to one higher order. To see why we need 
v to one higher order, suppose we have v = 1 — p + 0(e). Then we would have the estimate 
~v* = v*(p — (1 — /) + 0(e) and integrating this expression for 0(g) time units leads to 0(1) 
terms. To estimate v to 0(e) we integrate ( 16.36b and apply our apriori bound of Av = 0(1) to 
arrive at, 



where, 



Av(t)^(l~p(t))+I l +I 2 + 0(e 2 )(t~T III ), (6.43) 



h = eAv(0)exp[- f ds(\-p)] (6.44) 
JTm 

j dsexp[- [ ds' (1 - p)]p(l - (1 + a)p) 



h = e 



h is 0(e) for all t e [f///,f/v] since l-p(t) > 0(1). Consider 7 2 . We have p = ep(l - (1 + 
a)p) + 0(e 2 ). So we have, 

h= f dsexp[~ f ds'(l-p)]p (6.45) 
We now apply an integration by parts trick to obtain an expansion for l^. Indeed, since every 
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derivative of p earns us another £, integration by parts gives the following. 

h= f ds(l-p)exp[- f ds'{\-p)]-^ (6.46) 
J fm Js 1 ~ P 



exp|- / ds'(l-p)] Ph 



l-p(t) 
So we have the expansion 



l-p(s) 
+ 0(e z )(t-T n! ) 



+ 0(e z )(t-T IH ) 

s=Tin 



v-* = (l-p) + £ Pm ;[\^ Mt) h 0( £ -)( t -T ni ). (6.47) 

We now consider p. First note that in Stage IV we have, 

p = ep(l-(l + a)p) + 0(e 2 ). (6.48) 

Solving for p gives, 

Mf)= (l + (l + a)p(f /// )(exp[ £ (^f /// )]-l) +0(£ )(f - rm) (6 ' 49) 

Now we turn to v* . From the arguments above, by plugging our expansion for v into 

v * = v*(/— v — v*), we have 

v*(f) = v*(T ni )exp[g(t) + 0(e)}. (6.50) 

Recall that 7/y is defined by V*(7/y) = e" 7 . Considering g(f) and the definition of Hiv, a 
Taylor expansion gives, 

Tw-Tm = — + 0{e) (6.51) 
e 

and, 

v*(f/v) = v*(f /// )expfe(f /v ) +0(e)]. (6.52) 

□ 



7 The Stochastic System 

In this section we consider the stochastic system J2.ll ). Similarly to section|6] our goal will be 
to find asymptotic expansions for u at the end of each of Stages I-IV and develop estimates 
for the dynamics of u within the stages. In this section we also consider the initial stage. 
We recall from section [6] that 8(t) = p{t) — (1 — /) and we emphasize that 8 depends on 
the deterministic system ( 12.31 l and not the stochastic system ( 12. 11 1. Finally, in this section we 
will assume that e obeys the scaling of Theorem|2] The arguments in the case of TheoremQ] 
scaling are similar, and in fact simpler. 
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7.1 Initial Stage 

Recall that at t = we have, 

Set Av = v - v(0) and Av* = v* - v* (0). Then define T by 

T = M{t :v* <£ (0,e q ),\Av\ =e q ,oi\Ap\ = e q } (7.2) 

Note that T is not quite T s , but the lemma below will show that the two are equivalent in the 
V — > °° limit. 

We have the following lemma. 

Lemma 7.1. 

lim P(v* (T) = e q or v*(T) = 0) = 1. (7.3) 

lim P(v* (T) = 0) = exp[- 4(/ - °ff - /) j ] . (7.4) 

v^oo 1 v ' ' (k* + l)(l + a) 1 v ; 

limF(r< 2lQgV J = l. (7.5) 

Proof. Define two stopping times, 

T A = M{t : |Av| = or |Apj = e q } (7.6) 
r*=inf{f :v*^ (0,£ 9 )} 

Then clearly T = minjTA, T*}. We now consider the following stochastic system. Essentially 
this system is (12. U except that v*, v and p becomes fixed once v* exits (0,£ 9 ), |Av| > £ q , and 
|Av| > £ q respectively. % is the indicator function. 

dv=x(\v-v{0)\<e q )v{l-(v+v*)-p)dt (7.7) 



+Z (|v-v(0)|<^)J^±^^^ lW , 



dv* = X (v* < e q )v*(f~ (v + v*))dt + X (v* < e«)\/ V * ik * + ^ +V * )) dB 2 (t), 

dp = x(\p-p(0)\<e q )ep{v-p)dt 

Notice that in the system ( 17.71 i we are guaranteed |v — v(0)| < £ q , \p — p(0)\ < e q , and v* < e q 
for all time. Define ?a and % analogously to Ta, and notice that T = min{7A, %} since up 
to time T the systems d2.U and ( 17.7b evolve identically. 

We now consider f*. To control v* we bound it from above and below by two diffusions 
that are simpler to analyze. Set, 



</f,(/ ) = v A U - v(0)ul, + \ r 1{k * + *l +P + " )) dB 2 (t), (7.8) 



dv B (t) = v* B (f-v(0)-e q )dt + x ^ ^ B 2 ft), 



i^r + vl+P+v)). 
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and notice that the drift term in v* is less than and greater than the drift terms of v* A and v* B 
respectively. By pathwise uniqueness of solution we have v% < v* < v* A pointwise for all 
t < f* on the probability space of the two Brownian motions Bi,B 2 10 ■ F° r an y t we have 
the following bound, 

P{% <t)> P(v* A (t) = 0) +P(v* B (t) > e«). (7.9) 

We would like to run and v* B long enough to ensure that either these diffusions will be 
absorbed or reach e q . If we ignore the variance terms in ( 17. 8t we would find that v* A (t) = E q 
for t /--v(o) l°gV. With this in mind, we set i = jz^7q] log V and proceed to examine v A (i) 
and v* B (i). We consider first P(v* A (t) = 0). Set x A = Vexp[-(/ - v(Q))t]v* A , then 



dx A = y/x A (k* + v(0) + 0{£i)) exp[-(/ - v{0))t]dB 2 (t). (7.10) 
We now perform a time change, Wji(t(Y)) = x A {t) with 

Tl(0 = (F + l + O(e«))exp[-(/-v(0))?]. (7.11) 

This leads to 

dw = y/wdB 2 , (7.12) 

with w(0) = 1. Note that w is a weak solution to ( 17.12b since w is not B 2 measurable. How- 
ever, we only care about the distribution of w(i ), and by weak uniqueness w(f ) will have the 
same distribution as the strong solution of ( 17.12b lfl2Tl . 

Recall we are interested in P(v A (t) = 0). Let t A = ?k(f ). Then, 

P(v A (i)=0)=P(x A (i)=0)=P(w[t A ]=0). (7.13) 

Explicit integration gives 

1 k* + 1 e q 

^^T-^ + ^^TT^ (7 - 14) 

2 /-v(0) /-v(0) 

Then standard results, see J2l P- 260, give 

Pm) = 0) =P{w[t A ] = 0) = exp[-^] = exph 4 ^"^ ^ ] + 0(e«). (7.15) 

Now we consider P(v* B {i) > £ q ). We perform the same series of transforms as we did for 
except that now we have, 

x B = Vexp[-(/- v(0)-2e'')f]vj, (7.16) 
^) = ( ^ + 1 ' Q(£g) )ex P [(/-v(0)-2^]. 

Setting tg = Tg(f) we can compute, 

t B =t A {l + 0(j^-)). (7.17) 

We can follow our transforms to find the following, 

P(v* B (t)>e c >)=P(w[t B }>a) (7.18) 
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where a = exp [(/-v(o) V o(^))fj ■ Plu gg in g in our scaling for e gives a = Q( v(lo [ v)? ). All this 
leads to, 

lim P(w[t B ] >a)= lim P(w[t A ] ^ 0), (7.19) 

v — >°° v — 

If we now examine the dynamics of v and p, the same type of methods that we applied to v* 
show that Ta > O(logV) with probability approaching one and ( 17.4b follows. To see ( 17.5b 
notice that, 

lim P(T < i) = lim P(w[t A ] = 0) + lim P(w[t B ] > a) = 1 (7.20) 

V — »oo V — >oo V — >°° 

□ 



7.2 Stage I and III 

We assume that u(T s ) = u(T s ) and that u(T s ) satisfies the conclusion of Lemma l7.ll Then we 
have the following result which covers stage I behavior. An identical analogous result exists 
for stage III behavior. 



Lemma 7.2. 



P( sup ||„(,)- ii ( t )||.>-L)< 0( C°^ V )>^ ) (7.2!) 

T s <t<T, V* 



Proof. Set D = -\ and define 



where 



u — u 

E = (7.22) 

D 

We have, 

a W-a(a) 

«(») = ( v " ( v*(7-"v-^f \aiu) = ( ) (7 . 24) 

Let r be a stopping time defined as T = min{f > T s : ||-E(f)|| > 1} an d E T (t) = E(t A T). 
We will show that T » | log e | with high probability and this will lead to T 3> 7} with high 
probability. Employing a Taylor expansion, we have for all T s <t <T, 

dE T {t) = {Va(u)E T + 0(D))dt + G ( U t?^ dB (7.25) 



Using the integrating factor exp[— J dsVa(u)] we can arrive at 



tf\T 



E T {t)=l dssxp[J ds'Va(u(s')))0{D) (7.26) 
dB(s)exv[J ds , Va(u(s r ))} [ ; 
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Simple computaiton gives 

/ (1 — v— v* — p)— v —v —v 

Va(«)=( -v* (/-v-v*)-v* J (7.27) 

\ ep -2ep ' 

Since v,v*,p are bounded we have ||Va(fi)|| < 1. Using this bound in ( |7.26t along with a 
standard martingale argument gives, 

E[ sup £ 2 (fAr)]<0(D 2 (|log £ |) 2 e X p[2|log £ |]) + |log £ | eXp [ 2 ''° 2 g£|] (7.28) 

f<C| loge| 

Recalling that £ = 0(tXv) gives, 



E[ sup ^( f A r )l^( (lQg ' 0g ? 0gV)2 ). (7.29) 

r<C|loge| W 

A simple Chebyshev argument now gives, 

P( sup \\(u-u)(tAT)\\„ >\)< ( ^ l0 ^y°^ 2 ) (7.30) 
r<C| loge V? 

But then by the definition of T we may replace t A T by f in the expression directly above. 
Finally we note that 7} = 0(|loge|) and 5(f) = 0(e ?+1 ) in time 0(|loge|). Using ( I7.301 l we 
see that 7} = <9 ( | log e | ) and so we may replace the term sup r<C | i oge in ( 17.301 ) by sup f ^ r; . □ 



7.3 Stage II 

Now we consider the time interval [7}, 7}/]. This interval is where stochastic effects become 
important and v diverges from v. Note that from Lemmas [6.1| and |7~2l we have \u — u\ < 0(~t) 

outside of a set with O(-V) probability. As a consequence, 

|2h7 -7/7/1=0(^-5-) (7.31) 

v(7» = e* 
|/-v*(2»|<0(e«) 
5(7}) > 0. 

We define the function (j)(t) = J^<is(l — /) — p(s). Notice that <j) depends on p(Tj) although 
we mostly suppress this dependence. By the results of Lemma l672l we can integrate p to arrive 
at 

<j)(t) = (1 - f)(t- 7» - J- log(l + eap{T,){t - 7») + 0(e«) (7.32) 

Since p is strictly decreasing in Stage II, has a single critical point which we set as ^ (as 
mentioned in section|5]l. We have <j>'(^f) = (1 — /) — p(^jr) = 0. Since we can solve for p on 



27 



Stage 77, we can solve for Sn- In fact, 



Jg = er/+ «(i-%(r f ) +Q ^ (733) 



0( — ) = 

e ae 



■0{f), 



f(-^) = ea(l-f) 2 + 0(e«), 

We split the interval [7}, 7}/] into three pieces using the times fo = ^ — and f i = ^ + ^ 
where m can taken any value between j and |. A consequence of our arguments will be 
7} < fo < 1 1 < 7}/. The following lemma shows that in the time interval [7}, to], u stays close 
to u. 

Lemma 7.3. Outside of a set Q. such that P(Q.) < e q ~ z we have for any t £ [7},fo], 

\v(t)-v(t)\<ev(t). (7.34) 

|v*(0-v*(r)|<£^ (7-35) 
\p(t)-p(t)\<e*£. (7.36) 

Proof. From Lemmas |6 . 1 1 and [7721 we see that ( I7.34b - d7.36b hold for t = T[. In Lemma 17721 
we were able to scale u — u by a constant D. Here, things are not so simple. Set r — and 
define, 

v—v 

Wi 

(7.37) 



V —V 

8 r 
P-P 



where h(t) = v(7})exp[0(r)]. Note h(t) = v{t){\ + 0{e q - 1 )). 

We write E = (Ei,E 2 ,E 3 ) and set T = inf{f : ||i?(f)||<» > £,t < 7}/}. Now, notice that 
the coordinates of a given in ( I7.241 i are all second order polynomials and so an exact second 
order Taylor expansion exists for a. Using this Taylor expansion gives at time t <T, 

dE x = gi {t)Ex + 0(e r ) + -^v^fii (7.38) 
h(t)y/V 

dE 2 =g 2 (t)E 2 + 0{e r ) + ^^LdB2 (7.39) 

dE 3 =g 3 (t)E 3 + 0{e r ). (7.40) 

where 

f -f( t ) + (l_v*-v-p) v 

g{t)=( (f~v-2v*) (7.41) 
v -2pe ' 

We would now like to use g(t) as an integrating factor. Notice first, 

g(t)=( -f + 0(e«) (7.42) 
v 0(e) J 
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Set Gi(s,t) = exp[/ s ' ds'gi(s')]. Then we can integrate ( 17.381 1 using G and arrive at, 

ftAT rtAT a ( u ) 

E l {tAT) = dsG l (s,tAT)0(e r ) + dBi(s)G\(s,t AT) u ' (7.43) 
Jt, Jt, h(s)VV 

E 2 (tAT) = dsG 2 (s,tAT)0(e r )+ dB 2 {s)G 2 {s,t AT)^jL (7.44) 



7> eiVV 

rtAT rtAT 

E 3 (tAT) = / dsexp[ ds'g 3 (s')]0(£ r ). (7.45) 

J T] Js 

Initial condition terms involving E(Tj) that should appear in the expressions directly above 
are of lower order, so we have suppressed them for simplicity. From the above relations we 
find directly £3 (f A T) = 0(e r ~ l ). Tobound£i we consider second moments. 



E[jup El{t'AT)] < 0(e 2 ^)+E[( ' dB(s)J V{l ^ ^ p) ) } 



T,<t'<t 



(7.46) 



where we have used the fact that Gi(0,f') = 0(1). Forf < T, we have a(v) =0(v) = 0(h{t)). 
Using this observation and standard Martingale arguments gives, 

t , 1 [l AT dsexp\— (b(s)] 

E[ sup E\{tAT)]<0{e^) + 0{ — ^ FL ^ WJ ) (7.47) 

T,<t'<t n 1 !) v 

The situation is simpler for E 2 . We use (J 2 (u) < 1 and arrive at, 

£[sup£ 2 2 (r'Ar)] < (e 2r )+0(-^). (7.48) 

Using our moment bounds on E\,E 2 and pointwise bound on £3 we now bound E(t) and 
hence remove our restriction of t < T outside a set of small probability. Indeed first consider 
Ei, By a Chebyshev bound, 

P( sup >^) = 0(£"- 2 )+/ (7.49) 

T,<t'<t AT 1 

where 

..r^stm (7 .5„, 

Jt, v(7»£ 2 V 

Recall that v(f/) = e q . We consider / by performing a Taylor series expansion of <j> about 



I<^£dse*p[-m (7.51) 



1 r s "~ e , ,s„ 



dsexp[— <j)(— )] 



W*h ^exp[^(-)]0(exp[ ( ___ ) 2 ]) 
-5 — 0(exp[-0(-^)]). 
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Now, (j) is a function of p(Tj) for which we have an explicit asymptotic expression. Us- 
ing the expansions of Lemma loTI we can express p(Ti) in terms of p(T s ) (the difference 
is 0(e|loge|)). Applying a Taylor series argument for 0, we can then compute <j>(^r) in 
terms of p(T s ). Recalling that p(T s ) — + 0{e q ) we can arrive at the relation between 
0ii m and (j) given in (15.7b . Plugging this into ( 17.5 U and using the scaling of Theorem [2] 
gives / = 0(exp[— 0{-^)\). Bounds for £2, £3 are straightforward and we can conclude 
P(T < t ) < 0{e q - 2 ). □ 

Now we consider the interval [?o,fi]- This interval is where stochastic effects become 
important and v diverges from v. 

Lemma 7.4. If \ < m < § then outside of a set Q. with P(Q) < 0(e 2 ) we have, 

sup \ P (t)-p(t)\<0(e^) (7.52) 

te[to,n] 

sup |v*(f) — v*(r)| >0(e*r) 



Set, 



Then, 



>=■// - 



exph0(^)l 

( Y B " ) (7.53) 



a{\-ff K Vv(7»Ve 



^ S " = ^^ )K ' (7 ' 54) 
- w[v^(*+l)S ff ]v(lb)(l + 0(e 2 - 3m ), (7.55) 



ami 



1 r\flK 1 

OS — - =S// ds n r>(„2-3 



-—-Al+0{e l - im ) (7.56) 

1 



Vv(s) Jo w[(k+l)S n s] 
Proof. We first define a stopping time T as follows, 

r =ti Ainf{f > f : v(t) > £ c/ or |v*(f) - f\ > e q }. (7.57) 

Letz(f) = Ezl and recall from Lemma|73]thatr= 2+1. Thenz(f AT) = -e(p + p)z+0{e r ) 

and we have z(0) < 0(e r ). We can conclude \p — p\ = 0(e r ) for t £ [to,T]. A similar 
argument shows |v* — v*| = 0(e r ) on [to,T]. 
Now we turn to the dynamics of v(t A T). 



dv = vWit) + o^))dt + ffi+f+pM+oW) ^ (7 . 58) 

Next, we linearize <j)'(t) and p(t) about f-. Recall f (S) = and p( s f) = (1 -/). 



,/,- = ,-, «j,"(^)(r - ^) + O( £ ^'- m 0)^+ y 1} + ^ £l ~ m)) dBi (7.59) 
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Now we control v with the same techniques used in Lemma 17711 Define diffusions v A) Vb on 
the Brownian motion space produced by B\ ,Z?2 as follows, 



.ifi! (7.60) 



and 



affii (7.61) 



where C is an (9(1) constant. Then up to time T, vb < v < v A . We will show that vb(T) — > 
VA(r) in distribution and this characterizes v(T). Consider v A . We proceed as in Lemma l7TTl 
First we define, 

Xa (j) = ^) e xp[-0"(^) fds(s~^)+Ce 2 ^-'% (7.62) 



v(*b) 

Then define 

w A (T A (t))=x A {t) (7.63) 

with 

Under these transformations, w A is a weak solution of the following Feller diffusion. 

dw A = y/wXdBi. (7.65) 

We would like to integrate ( 17.64b to obtain T A . This will be made simpler with the following 
formula for v(?o) in terms of v(7}). 

v(f ) = v(7»exp[0(r o )](l +0(e)) (7.66) 
= v(7»exp[0(^) - <j>»(f) ^) +0(e 2 - im ))]- 



Integrating (17.64b and applying the above relation gives the following, 
T A (O=exp[-0(— )] jf A — ^ i e xp[0"( — ) J^ds{s-—) + 0{e 2 im )] 



(7.67) 



/ exp[-0(f)](^+l)) w 1 



' ^ 2 )«&h/e«(l-/) 2 (r-^) (l+Q(e 2 - 3m )) 



where <I>(f) = f_ 00 iisexp[— £■]. 

With d7.66t and ( |7.67t we can compute v^(f). We have, 



v A (0=w[T A (0]v(r )exp[f' (*/,) f ds{s -*-?-) + 0(e 2 -* m )] (7.68) 

■/«o e 

= w[T A (f)] v(r )exp[ea(l -/) 2 f <M*~ -)](1 + 0(e 2 - 3 '")). 

7(0 e 
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The results for vg are identical and so we can conclude v(t AT) = Va(? AT)(1 + (9(e 2 ~ 3m '). 
As described in the proof of Lemma l7.3l a Taylor series argument on <j) using the expansions 
for p developed in Lemma |6T] allows us to derive J5.7I ). Using this formula and our scaling 
for e gives ( 17.541 i. 

Now we eliminate the T dependence of our result. Since VA(t) = v(to)w[x(t)] and t(?) 
() \ we can conclude that VA(t) <C £ q with high probability. More precisely, through a 
Chebyshev inequality we have 

P{ sup v A {t) > e*) < P(w[x(t)] >^t)< P(w[T(t)} > 0( exp [-0(-L)])) < (e 2 ). 

(7.69) 

( 17.55b now follows by plugging in t = t\ in ( 17.68b . and ( 17.56b follows by using ( 17.681 ) in 
the integral ds^pj and applying the substitution, s — > Ta(s). □ 

Finally assuming that v{t\ ) ^ 0, we consider the dynamics of v on [t i , 7}/] . 

Lemma 7.5. Assume v(t i ) = r\v(t(f)for some r\ > 0. Then for t S [?i , 7}/] outside a set £2 with 
P(£l) < S q ~ 2 we have, 

\v*(t)-f\<0(e^) (7.70) 

pCT//) = />(?>/) - ea 2 P f (r -L , logT? + 0(e 2 ). (7.71) 
1 - J - PV l u) 

Proof. After time fi the system returns to deterministic behavior. By the same arguments 
used in Lemma [731 we have, 

\v\t)-v\{t)\<e^ (7.72) 
\p(f)-pi{t)\<e*T. (7.73) 

In Lemma 17731 we showed that in [7},fo], v is well approximated by v. The same holds in 
[ti,Tjj], except that now we must restart the deterministic system so that v(t\) — v(?i). With 
this in mind we can apply the arguments of Lemma 17. 3 1 to justify the following relation. 

v(0 = tjv(?i ) exp[0 (?) - <j>(h) + 0(e*r)] {1.1 A) 

Then if we use the arguments of Lemma l7T3l in which we prove |v — v| < ev we can argue as 
follows, 

v{t) = r/v(0(l + O(e))exp[-(0(rO - 0(f o ))] (7.75) 
= nv(t)(l + 0(e)) e x V [-U"(s I1 )((t 1 -s n ) 2 - (s„-t ) 2 )\ + 0($"'(s II )(t 1 -? ) 3 )] 
= riv(t){l + 0(e 2 - 3m )) 
Now we consider 7}/ in comparison to 7//. Perturbing off of 7}/ gives, 

v(7}/ + Ar) = tjv(7>/)(1 +0(e 2 - 3m ))exp[(l -f)At- ^log(l + aep(T n ))At)] (7.76) 
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A Taylor expansion on Af leads to 

v(T n + Af ) = 77 v(T n ) (1 + 0(e 2 - 3k )) exp[(l - /)Af - p(T n )At + 0{At 2 ) ) ] (7.77) 



We want to find At such that v(7}/ + Af ) — e q since then we will have 7}/ = 7}/ + Af . Solving 
using ( 17.77b gives, 

Af = -(^y)log77 + 0(e 2 ) (7.78) 

Our main interest in determining 7}/ is our need to compute an expansion for p(Tjj). We 
have, 

p(T„) = p(T I1 )+p>(T II )j + 0(e 2 ) (7.79) 
= p(T n )-e^^)\ogi 1 + 0( £ 2 ) 

□ 

7.4 Stage IV 

Stage IV is similar to Stage EL As in Stage H, using Lemmas 16.31 and 17721 we have outside of 
a set of vanishing probability, 

\T m -T ln \<0{-\) (7.80) 

\v(T m )-(l-p(T m ))\ = 0(e), 
v*(T u ,) = e cl 
5(7«/) < 0. 

We set y/(t) = fQdsp(s) — (1 — /). Then y/ has a single critical point which we label ^j-. In 
Stage II we define f = ^ — -pg and fj = ^ + In this section, for Stage IV we define 

~~£ £™ 1 - T" + Pf' 

The proofs of Stage IV are almost identical to those of Stage II. We state the analogues 
of Lemmas |7 . 3 1 and [731 since the proofs follow identical arguments. In Lemma [7771 we only 
keep 0(1) terms for the p(T/v) because we do not need the 0(e) accuracy in the later cycles 
that we need in the first cycles. 

Lemma 7.6. Outside of a set £2 such that P(£l) < 0(s 2 ) we have for any t € [7///,^], 

\v(?)-y>(t)\<e*£. (7.81) 

|v*(r)-v*(0l<ev*(0 (7-82) 

\ P (t)-p(t)\<e^. (7.83) 
Lemma 7.7. Suppose v*(t*) = t?v*(?q). Then fort E [fi,7>y], 

\ v (t)-(l- P (t))\<0(e) (7.84) 
P (T IV )=p(T, v ) + 0(e) 
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Now we turn to the analogue of Lemma 1741 The arguments are essentially the same, but 
we consider y/ instead of (j). We have, 



v( — = - 

e e 



" £7}//+log[ (/-a(i-/))p(r m ) ] ' (7 ' 85) 

1 g( i-(i + g )p(7>„) ) _ (i _ /)ipg( (i- / )(i-(i + a)p(7> ff )) ) - 



1 + a bV /-a(l-/) ' v J ' by {f-a{\-f))p{T m ) 
W"( s -f) = e(f-a(l-f)) + 0(e 2 ). 

Then the analogue to Lemma 17.41 is the following result. We do not need the expression 
f t i ds ^/ s \ because the lineages of the mutant type must coalesce to the original mutant cell 
by time zero. 

Lemma 7.8. If \ < m < I then outside of a set Q. with P(£2) < 0(e 2 ) we have, 

sup \p(t)-p(t)\ <(9(e 2 r i ) (7.86) 

fe[«b,fi] 

sup \v*(t)-v*(t)\>0(e*¥) 

Set, 



1 ,exp[-f(^)l 

-( _,- , V ) ( 7 - 8? ) 



(1 -/)(/- a(l-/)) v Vy(f///)V^ 



Then, 



fcH ('V«(.-j)) ( Ms£g^) (7 - 88) 

(recall the definition of V\jy in \3.9\ from Theorem^, 

v*(ti) = vv[^(F +/)S/v] ( 1 ~ /? / (7>//) )exp[-g(f /// )]r(f )(l +0(£ 2 - 3 '")) (7.89) 

The expression T7/y in (17.881 1 requires explanation. If the wild type is not lost, then it 
experiences a stochastic perturbation in Stage II. As Lemmas 17.41 and 1731 show, this pertur- 
bation is 0(1) and influences p(Tu) by 0(e). The 0(e) perturbation on p(Tu) is integrated 
over Stage IV which is of duration 0(j) and so the perturbation has an 0(1) effect on v* 
during Stage IV. This is where r\iv comes from. More specifically, when we expand y/ in 
Taylor series to obtain an expansion in terms of y/ii m , the 0(e) term in ( 17. 7U in Lemma [731 
is responsible for T]/y. 



Appendix 

Here, we explain the rescaling of section [2] precisely. We introduce dimensional constants 
V, P, T for the units of infected cells, CD8 cells, and time respectively and define the non- 
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dimensional variables v, v*,p by setting 

v(Tf) - v*(Tf) p(lf) . f 

v(*) = -yA v*(f) = -^, p{T) = ^ ¥ L, t = -. (A.l) 

We can then hope to approximate the birth-death process through the following SDE. 



,/f = Tr|Ait- t -v, ( ,ff 7 -, „/,- + JW(k+cV(v + v*) +aPp) ^ i(f)) (A.2) 



<h'- : 'TV' (AA r""'(f - r ) ),//' - \j V * T( - k * + ^ + V *^ dB 2 (t) 



dp = Tp(bVv-dFp)dt+ f T{h + b l" + d ^ P) dB,(t). 

We choose T, V, and P so that AAT = 1, cVT = 1, and aPT = 1. Plugging this into (|A~2| > 
gives 



dv - v(l - (v + v~*) J V(fc+(V ^ V * )+P) ^i(0, (A.3) 



d* = v*{f- {v+v*))dt + J V * {k * + ^ +V * )) dB 2 (t), 



~,~ ~ \ ,~ ep(h + v + ap) ,„ ... 
dp = ep(v - apjdt + y — — —dB^t). 

where, 

A - A* d 
A= — -, A* = — -, e = -, « = -. (A.4) 
AA AA c a 



We assume that the coefficients in dA.3b are all (9(1). This will be true if V,P are on the order 
of the infected cell and CTL population counts and the system is assumed to vary on that 
scale. We now drop the tildes and, for simplicity, the variance terms in the p equation. This 
gives ( 12.11 ) 
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